High-level approaches to confidence estimation in speech recognition
نویسندگان
چکیده
منابع مشابه
High-level approaches to confidence estimation in speech recognition
We describe some high-level approaches to estimating confidence scores for the words output by a speech recognizer. By “high-level” we mean that the proposed measures do not rely on decoder specific “side information” and so should find more general applicability than measures that have been developed for specific recognizers. Our main approach is to attempt to decouple the language modeling an...
متن کاملA high-level approach to confidence estimation in speech recognition
Errors in the output of a speech recogniser can be said to be due to the interaction of inadequate phonetic and language modelling components. We investigate an approach to estimating confidence scores for the words output by a recogniser in which the language modelling and acoustic modelling are decoupled by the use of a phone recogniser working in parallel with the word recogniser. An advanta...
متن کاملConfidence Estimation for Automatic Speech Recognition Hypotheses
Automatic speech recognition (ASR) systems produce transcriptions for audio which sometimes contain errors. It is useful to know how much condence may be placed in this output being correct. Condence estimation is concerned with obtaining scores which quantify this level of condence. e development and application of a principled, exible framework using conditional random eld (CRF) models f...
متن کاملMeta-models for confidence estimation in speech recognition
We describe an approach to confidence estimation that attempts to decouple the contributions of the acoustic and language model components to speech recognition output. The output of the acoustic models when decoding phonemes is itself modelled using HMM’s to produce a set of models which we term meta-models. When benchmarked against a “standard” method for assigning confidence (the N-best scor...
متن کاملStream confidence estimation for audio-visual speech recognition
We investigate the use of single modality confidence measures as a means of estimating adaptive, local weights for improved audio-visual automatic speech recognition. We limit our work to the toy problem of audio-visual phonetic classification by means of a two-stream Gaussian mixture model (GMM), where each stream models the class conditional audioor visual-only observation probability, raised...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Speech and Audio Processing
سال: 2002
ISSN: 1063-6676
DOI: 10.1109/tsa.2002.804304